Decision making with inference and learning methods

نویسندگان

  • Matthew William Hoffman
  • Ajay Jasra
چکیده

In this work we consider probabilistic approaches to sequential decision making. The ultimate goal is to provide methods by which decision making problems can be attacked by approaches and algorithms originally built for probabilistic inference. This in turn allows us to directly apply a wide variety of popular, practical algorithms to these tasks. In Chapter 1 we provide an overview of the general problem of sequential decision making and a broad description of various solution methods. Much of the remaining work of this thesis then proceeds by relying upon probabilistic reinterpretations of the decision making process. This strategy of reducing learning problems to simpler inference tasks has been shown to be very fruitful in much of machine learning, and we expect similar improvements to arise in the control and reinforcement learning fields. The approaches of Chapters 2–3 build upon the framework of [Toussaint and Storkey, 2006] in reformulating the solution of Markov decision processes instead as maximum-likelihood estimation in an equivalent probabilistic model. In Chapter 2 we utilize this framework to construct an Expectation Maximization algorithm for continuous, linear-Gaussian models with mixture-of-Gaussian rewards. This approach extends popular linearquadratic reward models to a much more general setting. We also show how to extend this probabilistic framework to continuous time processes. Chapter 3 further builds upon these methods to introduce a Bayesian approach to policy search using Markov chain Monte Carlo. In Chapter 4 we depart from the setting of direct policy search and instead consider value function estimation. In particular we utilize leastsquares temporal difference learn-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Introduction to Inference and Learning in Bayesian Networks

Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...

متن کامل

Evaluation of the Relationship between Assertiveness, Decision Making Styles and Organizational Learning of Health Managers in Shahrekord University of Medical Sciences in 2018

Background & Aim: Managers are always learning to make critical decisions, and learning organizations emphasize continuous learning to survive in the current competitive environment. This study aimed to evaluate the relationship between assertiveness and decision making styles and organizational learning of health managers of Shahrekord University of Medical Sciences. Materials and Methods: Thi...

متن کامل

تأثیر یادگیری مبتنی بر الگوریتم بر تصمیم‌گیری بالینی دانشجویان فوریتهای پزشکی

Introduction: Improvement of students’ clinical decision making is one of the main challenges in medical education. There are numerous ways to improve these skills. The aim of this study was to examine the effect of algorithm-based learning on clinical decision making abilities of medical emergency students. Method: in this experimental study, twenty five medical emergency students were rand...

متن کامل

Effectiveness of the Self-determination Educational Package on Self-directed Learning and Decision-making Styles among High School Students

Introduction: The purpose of this study was to develop a self-determination educational package and determine its effectiveness on Self-Directed Learning and Decision making Styles of high school students. Methods: The research method was semi-experimental with pre-test, post-test with the control group and follow up. At first, self-determination educational package was compiled using library s...

متن کامل

Influencing factors on integrating professional learning of secondary mathematics teachers with the analysis, interpretation and decision-making of their teaching

The present study is part of a bigger research project and its purpose was to investigate the influencing factors on integrating professional learning of secondary mathematics teachers with the ways in which they analyze, interpret and make decisions regarding their teaching. For the fine-grained analysis of the first layer of data, phenomenography method was used. Nine teachers were interviewe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013